Who Let the CAT Out of the Bag? Accurately Dealing with Substitutional Heterogeneity in Phylogenomic Analyses.

نویسندگان

  • Nathan V Whelan
  • Kenneth M Halanych
چکیده

As phylogenetic datasets have increased in size, site-heterogeneous substitution models such as CAT-F81 and CAT-GTR have been advocated in favor of other models because they purportedly suppress long-branch attraction (LBA). These models are two of the most commonly used models in phylogenomics, and they have been applied to a variety of taxa, ranging from Drosophila to land plants. However, many arguments in favor of CAT models have been based on tenuous assumptions about the true phylogeny, rather than rigorous testing with known trees via simulation. Moreover, CAT models have not been compared to other approaches for handling substitutional heterogeneity such as data partitioning with site-homogeneous substitution models. We simulated amino acid sequence datasets with substitutional heterogeneity on a variety of tree shapes including those susceptible to LBA. Data were analyzed with both CAT models and partitioning to explore model performance; in total over 670,000 CPU hours were used, of which over 97% was spent running analyses with CAT models. In many cases, all models recovered branching patterns that were identical to the known tree. However, CAT-F81 consistently performed worse than other models in inferring the correct branching patterns, and both CAT models often overestimated substitutional heterogeneity. Additionally, reanalysis of two empirical metazoan datasets supports the notion that CAT-F81 tends to recover less accurate trees than data partitioning and CAT-GTR. Given these results, we conclude that partitioning and CAT-GTR perform similarly in recovering accurate branching patterns. However, computation time can be orders of magnitude less for data partitioning, with commonly used implementations of CAT-GTR often failing to reach completion in a reasonable time frame (i.e., for Bayesian analyses to converge). Practices such as removing constant sites and parsimony uninformative characters, or using CAT-F81 when CAT-GTR is deemed too computationally expensive, cannot be logically justified. Given clear problems with CAT-F81, phylogenies previously inferred with this model should be reassessed. [Data partitioning; phylogenomics, simulation, site-heterogeneity, substitution models.].

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Microdosimetry: experimental methods and medical applications

Introduction: Microdosimetry is a fundamental method that studies the nature of energy transfer in micron volumes in the particular biological cells. In a biological target, the amount of ionization does not indicate the magnitude of biological radiation-induced damage. However, the severity of biological harm depends strongly on the amount of the linear energy transfer along t...

متن کامل

Phylogenomic analyses of large-scale nuclear genes provide new insights into the evolutionary relationships within the rosids.

The Rosids is one of the largest groups of flowering plants, with 140 families and ∼70,000 species. Previous phylogenetic studies of the rosids have primarily utilized organelle genes that likely differ in evolutionary histories from nuclear genes. To better understand the evolutionary history of rosids, it is necessary to investigate their phylogenetic relationships using nuclear genes. Here, ...

متن کامل

بررسی ارتباط بین ویژگی های فردی و مشخصات کیف مدرسه با گردن درد در دانش آموزان مقطع متوسطه شهر بابل

Objective: Neck pain is the second common musculoskeletal disorders in societies. According to studies, personal characteristics and school bags are predisposing factors for musculoskeletal problems in students. Due to the risk of neck pain, the aim of this study was to investigate the effect of risk factors on high school students. Methods & Materials: This study was carried out on 1000 hig...

متن کامل

Investigation of the Relationship between Ultrasonographic Findings and Analyses of Bladder Contents in Cats

Background: Echoes are frequently observed in cat bladder contents through ultrasonography and often mentioned as incidental findings. No comprehensive study has been conducted so far on the precise echoes nature in the bladder contents ultrasonography in cats. OBJECTIVES: The purpose of this study is to provide an accurate description of the cat bladder contents echoes and to specify the relat...

متن کامل

بررسی اثر اسپری سالبوتامول روی برونکواسپاسم و هیپوکسی در بیماران با مصرف سنگین سیگار حین بیهوشی عمومی

Heavy smoker patients who are candidates for general anesthesia with tracheal intubation have hyperactive airways and are in risk of bronchospasm hypoxia after intubation and during operation. They also have repetitive coughs after extubation and recovery. The purpose of this study was to investigate the effects of inhaled Albuterol before induction of anesthesia on preventing bronchospas...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Systematic biology

دوره 66 2  شماره 

صفحات  -

تاریخ انتشار 2017